Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric
نویسندگان
چکیده
منابع مشابه
Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric
Data imbalance is frequently encountered in biomedical applications. Resampling techniques can be used in binary classification to tackle this issue. However such solutions are not desired when the number of samples in the small class is limited. Moreover the use of inadequate performance metrics, such as accuracy, lead to poor generalization results because the classifiers tend to predict the ...
متن کاملMining Imbalanced Data with Learning Classifier Systems
This chapter investigates the capabilities of XCS for mining imbalanced datasets. Initial experiments show that, for moderate and high class imbalances, XCS tends to evolve a large proportion of overgeneral classifiers. Theoretical analyses are developed, deriving an imbalance bound up to which XCS should be able to differentiate between accurate and overgeneral classifiers. Some relevant param...
متن کاملAbsent data generating classifier for imbalanced class sizes
We propose an algorithm for two-class classification problems when the training data are imbalanced. This means the number of training instances in one of the classes is so low that the conventional classification algorithms become ineffective in detecting the minority class. We present a modification of the kernel Fisher discriminant analysis such that the imbalanced nature of the problem is e...
متن کاملA Correlation for Estimating LCPC Abrasivity Coefficient using Rock Properties
Rock abrasivity, as one of the most important parameters affecting the rock drillability, significantly influences the drilling rate in mines. Therefore, rock abrasivity should be carefully evaluated prior to selecting and employing drilling machines. Since the tests for a rock abrasivity assessment require sophisticated laboratory equipment, empirical models can be used to predict rock a...
متن کاملModified Sampling Strategies Using Correlation Coefficient for Estimating Population Mean
This paper proposes two sampling strategies based on the modified ratio estimator using the population mean of auxiliary variable and population correlation coefficient between the study variable and the auxiliary variable by Singh and Tailor (2003) for estimating the population mean (total) of the study variable in a finite population. A comparative study is made with usual sampling strategies...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS ONE
سال: 2017
ISSN: 1932-6203
DOI: 10.1371/journal.pone.0177678